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We compare the noise in interferometric measurements of the Vela pulsar 
from ground- and space-based antennas with theoretical predictions. The noise 
depends on both the flux density and the interferometric phase of the source. 
Because the Vela pulsar is bright and scintillating, these comparisons extend 
into both the low and high signal-to-noise regimes. Furthermore, our diversity 
of baselines explores the full range of variation in interferometric phase. We 
find excellent agreement between theoretical expectations and our estimates of 
noise among samples within the characteristic scintillation scales. Namely, the 
noise is drawn from an elliptical Gaussian distribution in the complex plane, 
centered on the signal. The major axis, aligned with the signal phase, varies 
quadratically with the signal, while the minor axis, at quadrature, varies with 
the same linear coefficients. For weak signal, the noise approaches a circular 
Gaussian distribution. Both the variance and covariance of the noise are also 
affected by artifacts of digitization and correlation. In particular, we show that 
gating introduces correlations between nearby spectral channels. 

Subject headings: methods: data analysis - techniques - stars: pulsars - pulsars: 
individual: Vela pulsar - interstellar scattering 



INTRODUCTION 



Radioa stronomical observations yield a dete rministic part, the signal; and a random 
part, noise ( [Thompson. Moran. &: Swensonl Il986l ). Consequently, the signal-to-noise ratio, 
the magnitude of the deterministic part divided by the standard deviation of the random 
part, characterizes them. An understanding of the noise is fundamentally important because 
it provides a measure of the possibility of detecting a weak signal, and of the reliability of the 
measurements of a detected signal. The noise is particularly important in situations where 
the signal varies, because the noise can mimic the signal. 
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interferometric visibility, the variance of the noise increases quadratically with the signal in 
phase with the signal, and linearly with the signal at quadrature to the signal. The constant 
and linear coefficients in these two directions are equal. 
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We tested this picture for the cross-power spectrum of a scintillating source - the Vela 
pulsar. These observations provided an ideal laboratory for such studies because the pulsar 
varies greatly in flux de nsity and interfero metric phase with frequency and time, because of 
interstellar scintillation (IDesai et al.lll992l ). Thus, each spectrum spanned many scintillation 
elements. Furthermore, the signal-to-noise ratio of the strongest spectral peaks can be high, 
even for short integrations, and the source contributes significantly to system temperature, 
so that self-noise is important. 

We compared predictions with observational estimates, formed by differencing samples 
close together in time and binning them according to their estimated average visibility. This 
procedure provides for conve nient visualization of the noise distribution. Our investigation 
extends our previous results (jGwinn et al.l l201ll ) to the regime of high signal-to- noise ratio 
and large interferometric phase variations. Because of pulsar gating, the number of samples 
per integration time was small and depended on the number of gated pulses within the 
integration time. From correlation functions, we determined the covariances of noise among 
spectral channels. Such covariances can be produced by quantization and by correlator 
effects. We discuss these effects and compare results with observations. 



1.1. Organization of this Paper 

Because the noise in the spectrum is drawn from a nearly Gaussian distribution with zero 
mean, variances and covariances characterize it. In Section |2j we introduce the theoretical 
basis for noise in interferometric visibility and present the mathematical descriptions used in 
the paper. In Section [3J we discuss our observations, correlation, and initial data processing. 
The remainder of the paper analyzes the noise in these observations. In Section HI we quantify 
the distribution of noise and the influence of a signal. In Section |5l we discuss quantization 
and correlator effects. In Section [61 we summarize our results and discuss implications for 
future observations and instruments. 



2. THEORETICAL BACKGROUND AND NOTATION 

2.1. Correlation Functions and Spectra 

Observations of electromagnetic radiation from astronomical sources measure and com- 
pare finite samples of electromagnetic fields. We suppose that these are drawn from ensem- 
bles of statistically-identical measurements. Noise is the difference between the result of a 
measurement and the average of an infinite ensemble of such measurements. The measure- 
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ments, as well as their statistical averages, can be expressed either as spectra, varying with 
frequency, or as correlation functions, varying with lag in the Fourier-conjugate domain. In 
practice, the samples are digitized: quantized and time-sampled. We introduce conventions 
for notation that help distinguish among these various domains. The tilde denotes entities in 
the spectral domain, indexed by the spectral channel as a subscript: for example, rv Unac- 
cented symbols designate the Fourier-conjugate domain of the correlation function, indexed 
by lag: r T . Angular brackets (...) n denote a statistical average over many realizations of the 
noiselike electric field, with the scintillation spectrum held fixed. The subscripted brackets 
(...)s denote an average over ma ny samples of the scintillation spectrum . These conventions 
are consistent with earlier work (IGwinnl I2004J : iGwinn &: Johnson! 1201 it ) . 



As in earlier studies, we supp ose that two antenn a s record time series of zero- mean, com 



plex Gaussian random variables (IGwinnl I2004J . l2006t IGwinn fc Johnson! 1201 ll : IGwinn et al. 



20 lit ). These time series can be regarded as the amplitude and phase of one polarization of 
electric field. The antennas are X and Y, and the time series xg and yg, where I indexes time. 
The ensemble-averaged cross-power spectrum fik, and the ensemble- averaged autocorrelation 
spectrum a^, fully describe the spectral properties of these series. At shorter wavelengths 
quantum-mecha nical effects introd uce shot noise, but the ensemble-averaged spectrum re- 
mains the same (jZmuidzinasI 120031 ) . For convenience, we assume unit variances for the real 
and imaginary parts. 

An observer forms the cross- correlation function, r T , of the time series xg and yg. The 
Fourier transform of r T gives the observed cross-power spectrum, f^. This observed spec- 
trum is an estimate of pk, multiplied by a correlator-dependent gain factor (see Section |5]). 
Similarly, autocorrelation of data at a single antenna yields an autocorrelation spectra a^, 
an estimate of a^. The autocorrelation spectrum may include an offset, from the corre- 
lation of spectrally-flat background noise, as well as a gain factor. In practice, individual 
measurements differ from the "true" spectrum p^ by a random amount: the noise. If 
many samples of ffe are averaged together, the noise for the average approaches a Gaussian 
distribution. 



At lag r, the cross- correlation function r T is: 



1 



Nobs - M 



(i) 



This equation parallels Equation 6 of IGwinnl (J2004J); in particular, iV b S is the number of 
samples. However, we have reduced the definition of r T by a factor of 2, by changing the 
normalization of xg and yg so that ( | | 2 ) = (\yg\ 2 ) = 1. Note that the sum here runs from 1 
only to iV bs — \t\, so that large lags are averaged over fewer samples. If the lag r < 0, then 
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the sum runs from |r| + 1 to iV bs. The finite spans of the time series prevent the correlation 
of all samples: for r > 0, no y^ +T exists for the last r samples of xg. For r < 0, no X£ exists 
for the first |r| samples of ye+ T . A pulsar gate, for example, can truncate the time series in 
this way. Equation [TJ describes the calculation of r T in many correlators of the "XF" design, 
where correlation "X" precedes Fourier transform "F". The S2 correlator at Penticton, used 
for the work described here, is of this design. In contrast, our previous theoretical work 
assumed averaging of all lags r over N nhf , elemen ts, because of the identification x^N ohs = xt 



for any I (lGwinnll2006r iGwinn fc Johnson! 1201 ll ). We call this the "wrap" assumption. Most 



correlators of the alternative "FX" design, where Fourier transform precedes correlation, 
obey the "wrap" assumption. However, by zero-padding the time series they can emulate 
the sum in Equation [TJ We discuss the consequences of these two formulations for the 
cross-power spectrum in Section 15.21 below. 

The statistical average of the cross-correlation function r T is (r r ) n = p T . This average 
runs over many realizations of the electric field, with uniform statistics. Our observations are 
not stationary and do not approximate an ensemble average because both the scintillation 
spectrum and the intrinsic flux density of the pulsar change with time. However, we can 
regard each observation as being drawn from an ensemble of observations of statistically- 



ident ical pulses at the same pulse phase and in the same scintillation state (see IGwinn et al. 



20111 ) 



The observed cross-power spectrum is the Fourier transform of the cross-correlation 
function: 

JV-l 

h = Yl ^ kr r T , (2) 

T=-N 

where the number of spectral channels is 2N. The statistical average of is (ffc) n = pk- For 
autocorrelation, the average over X(,x* l+T analogous to Equation [TJ forms the autocorrelation 
function a T . A statistical average of a T yields (a T ) n = a T . The Fourier transform analogous to 
Equation [2]yields the autocorrelation spectrum a fc , and an average over a statistical ensemble 
yields (a k ) n = a k . 



2.2. Noise Distribution for Visibility 

Noise in the cross-power spectrum is the difference of an observation and the ensem- 
ble average: ffc — (ffc) n . Typically, the observed spectrum is an average over a number of 
individually-formed spectra, as describd by Equations [TJ and [2j Consequently, the Central 
Limit theorem suggests that the noise follows a Gaussian distribution. Because the cross- 
correlation function is complex, this distribution is an elliptical Gaussian distribution in the 
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complex plane. 

When the signal is completely absent, as when the pulsar is off, one expects that pk = 
and that f & consists of noise drawn from a zero-mean, circular complex Gaussian distribution. 
Our observations match this expectation closely, as we discuss in Section 15.11 

If the signal is present, then the Dicke Equation describes the contribution of self-noise. 
This equation states that the error ST in measurements of antenna te mperature y aries with 



total system temperature T, including the contribution of the source (lDickdll946l ): 



m 2 = (3) 

Here, iV bs = Az/ x At is the number of samples, for an observed bandwid th Au and integra- 



tion t ime At. The analogous expression holds for interferometric visibility ( iThompson. Moran. fc Swenson 



19861 ). More generally, this equation describes the noise in the sample variance for draws 



from a Gaussian distribution. 

The real and im aginary part s of statistical averages of f& suffice to estimate both the 
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signal and the noise (jGwinnll2006l . Equations 11, 18, 19): 

(4) 

-axkOtYk 
Pkpk- 

Again, iVobs is the number of samples gathered, per spectral channel. The subscripted angular 
brackets (...) n indicate an average over many realizations of noise. As the first expression 
indicates, pk is the cross-power spectrum, averaged over an ensemble of statistically-identical 
realizations of noise. Analogously, a^k is the autocorrelation function at station X, and ayk 
is the autocorrelation function at Y. We assume single-sideband operation. Here we also 
apply the "wrap" assumption, discussed in Sections 12.11 and 15.2.31 The autocorrelation 
spectra are always real and are often identical between stations after calibration, with offsets 
for background noise. For a scintillating source observed on a long baseline, the spectra 
need not be identical, because the stations may lie in different scintillation elements in the 
observer plane. 

To help visualize the distribution of noise in Equation HI we divide the noise into compo- 
nents parallel with and perpendicular to the phase of the average visibility. The expression 
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for noise then takes the form: 

P = (5) 
a l = {\ & x\\&y\ + \p\ 2 ) 



2iV obs 
1 



2N, 



obs 



|ax||ay| - |p| 2 ) 



Here, is the phase of the average visibility. We have omitted the subscript k for clar- 
ity. For most interferometric observations, the intensity of the source is constant over the 
observer plane: ax = &y- The difference between a\\ and a±_ then produces an elliptical 
distribution of noise, with major axis aligned with the average phase </>. Usually, background 
noise contributes a constant offset to a, and the source contributes the rest. This form is 
quite general: it holds for any interferometric observations of a noiselike source, not just 
observations of a scintillating pulsar. 



2.3. Noise for a Scintillating Source 

2.3.1. Distribution of Noise 

A strong, scintillating source provides a good laboratory for the study of self-noise 
because it provides many independent observations of visibility, with different flux densities 
and (if the baseline is long) phases, under identical conditions. For a scintillating, pointlike 
source, with flux densities Ix and Iy at stations X an d Y, and backgrou nd noise equivalent 



to flux densities nx and riy, the signal and noise are (IGwinn et al.ll201ll . Equations 5-7): 



p = ^hJve^ (6) 
2 1 r n x n Y 



err 



" iVobs 

2 1 r n x n Y 
a — 



N, 



obs 



{ r ^ + \{n Y Ix + n x Iy)} 



The visibility phase, S , arises from phase differences between the pair of scintillation ele- 
ments in the observer plane. The variance of measurements of f at that phase is ajp the 
variance at quadrature is a\. The subscript for channel k is omitted; all quantities are for 
one spectral channel. Note that in this expression, the autocorrelation spectra at the two 
antennas axk , ®Yk differ both because the intensity of the source at the two antennas may 
differ: Ix ^ ly, and because of different noise at the two antennas: nx ^ ny. 
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2.3.2. Short- Baseline Limit 



For a short baseline, both antennas will lie within the same scintillation element. In this 
case, S — > 0, and Ia = Ib = I ■ However, the variance of the noise at the two antennas may 
still b e different. We can then express the distribution of noise in the form ( IGwinn et al. 
201 ll Equation 8): 



<y\ = b + bj + b 2 I 2 
al = b + hi 



(7) 



In this expression, 62 = 1/iVobs- This expression holds even if the source is mildly resolved 
by the scatterin g disk, because the normalized, ensemble-averaged visibility remains nearly 
1 JGwinnlhoOir i. 



2.3.3. Long-Baseline Effects 

If the baseline is long compared with the scale of the scintillation pattern in the ob- 
server plane, then the scintillations differ at the two stations. Thus, Ix 7^ iy- Nonetheless, 
inspection shows that the constant term, 60, and the quadratic term, 62, in Equation [7] still 
describe the behavior of noise correctly. The linear term, 6 1( does not. However, it converges 
to the same form in an average over many scintillation elements with the same vi s ibility p, 



for a point source. This is seen by extending the calculation of visibility in I G wind (120011 ) to 
include the intensities at the two antennas, most easily by numerical calculation. However, 
this average over realizations of scintillations converges much more slowly than the average 
over realizations of the noise. 

If the baseline is long and the source is resolved by the scattering disk, then \p\ 2 < ax&Y, 
so that |(r) n | 2 < Ixly- Consequently, a±_ will acquire some of the quadratic behavior of a\\ 
in Equation [6l For a small source size, the effect is second-order in the size parameter and 
will become a pparent only when the distribution of visibility is already significantly distorted 



flGwinnll200lh . 



2.3.4- Effects of Variability, Quantization, and Correlation 



The flux densities of pulsars in general, and the Vel a pulsar in particular, vary in- 
trinsically from one pulse to the next and within pulses (IKrishnamohan fc Downsl Il983l ; 
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Johnston et aJj|200l| ; Kramer et al.ll2002l ). Variations on timescales shorter than the time 
to accumulate one sample of the spectrum, the "accumulation time," lead to correlations of 



noise between spectral channels and increase the source noise contribution to a?, (IGwinn &: Johnson 
201ll ). If we parametrize these variations by 51 /I, with (SI) = 0, then the quadratic coeffi- 
cient, 62, in Equation [7] becomes, 



siy _L 

// + N ohs 



(8) 



T he other coefficient s, bp a nd b\, are unchanged. This calculation follows Section 3.3.2 
of IGwinn fc Johnson! (120111 ). but with /3 — 1 = 51 /I averaging to zero over the suite of 
observations rather than over the accumulation time for a single spectrum. 

Digitization, or more precisely quantization during digitization, also affects the noise. 
If the correlation is not extremely strong, p < 0.5, and if the data are viewed in the spectral 
domain, then the effects of quantization can be represented as a change in gain and a 
spectrally-constant offset. This offset is often termed "digitization noise" and contributes to 
the values of bo and bi, as one would expect. The expressions for noise take the same forms 
as in Equations [4] th rough [6] aboy e, but with corrections to station gains and noise levels (see 
Equations 56, 57 of iGwinnl 120061 ) . 



Correlations among spectral channels also characterize noise. For sources of constant 
i ntensity, nois e is uncorrelated between spectral channels, under the "wrap" approximation 
(lGwinnll2006l ). Relaxation of that assumption can lead to an observable correlation of noise 
between channels, as we discuss further in Section 15.2.31 Variation of flux density within 
the accumulation time fo r a single spectrum can also lead to significant correlations of noise 
(IGwinn fc Johnson! 1201 ll ). 



3. OBSERVATIONS, CORRELATION, AND CALIBRATION 

We observed the Vela pulsar on 10 Dec 1997 using a network comprising antennas at 
Tidbinbilla (70- m diameter), Mopra (22-m), Hartebeesthoek (26-m), and the VSOP space- 
craft (8-m). The observations began at 14:15 UT and ended at 22:40 UT, for a time span 
of 8:25. The observing wavelength was 18 cm. We observed left-circular polarized radiation. 
We recorded two 16 MHz frequency bands (IFs) at each antenna, both as upper sidebands. 
The bands spanned 1634 to 1650 MHz (IF1) and 1650 to 1666 MHz (IF2). The data were 
digitized (quantized and sampled) at recording time, thereby characterizing the electric field 
with a sign bit and an amplitude bit. The data are thus 4-level, or 2-bit, quantized. 

During the observations, the interferometer baseline from Mopra to Tidbinbilla had a 
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projected length of approximately 400 km. The baseline from Hartebeesthoek to Tidbinbilla 
had a projected length approximately 9,400 km. The baseline from the VSOP spacecraft 
to Tidbinbilla had length of approximately 27,000 km during the first period of data, from 
14:00:56 UT to 15:55:56 UT, when the spacecraft was near apogee; and of approximately 
22,000 km during the second period of data, from 20:45:25 UT to 21:13:00 UT, near the fol- 
lowing apogee. The first period spanned a longer time period and showed more homogeneous 
statistics. 



We correlated the data with the Canadian S2 VLB correlator ( iCarlson et al.lll999l ). This 
correlato r is a reduced-table 4- level correlator; in other words, the lowest-level products are 
ignored (IHagen fc Farley! 1 19 731 ). We correlated each IF separately with 8192 lags to form a 
cross-correlation function. 

We correlated the signal from the pulsar in 6 gates, synchronized with the pulsar's 
period of approximately 89 msec. Each gate was 1 msec wide. The first 5 gates covered the 
pulse. The sixth gate was located far from the pulse, where the pulsar was "off". Because 
of interstellar dispersion, each gate covered a range of pulse phases. Individual pulses also 
vary in intensity. We averaged each spectrum over a number of pulses, which reduced, but 
did not completely eliminate, this variation. We averaged the results of the correlation for 2 
sec, or approximately 22.4 pulsar periods, except on the baselines to the spacecraft, which 
we averaged for 0.5 sec, or approximately 5.6 pulsar periods. The pulsar was strong enough 
to contribute to the system temperature at the antennas; this contribution affected the noise 
through settings for the digitizers at the antennas, particularly at the most sensitive antenna. 

For a reduced-table 4-level cor relator, the o ptimal level settings are v o = ±0.90 standard 
deviations, with weighting n = 3 (jCooperlll970l ). Because the intensity of the pulsar varies 
greatly during the pulse, and because the quantizer levels were adjusted every 10 sec to 
optimal values for the previous 10-sec period, the quantizer levels ±Vq were not at this optimal 
setting in each gate. Table [1] gives the levels in Gate 1 (where the most variation occurred) 
and in the empty gate (which provides the most contrast). For Tidbinbilla, the largest and 
most sensitive antenna, the levels changed dramatically and the standard deviation was much 
greater when the pulsar was "on" because the pulsar made a large, variable contribution to 
system temperat ure. The variations are partic ularly large in Gate 1, at the leading edge 
of the pulse (see iKrishnamohan fc Downsl 119831 ). In contrast, for the VSOP spacecraft, the 
smallest antenna, the levels were the same on and off pulse, because the contribution of 
the pulsar was insignificant; the large standard deviation of Vq arose from a trend over the 
time span of the observations. For the medium-sized antennas, Hartebeesthoek and Mopra, 
the levels changed between "on" and "off" gates, but the standard deviations remained 
approximately the same. The correlator corrected for these changes in vq when estimating 
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the cross-power spectrum p; h owever, differences of vq among gates changes the properties 



of the noise in those gates (see iGwinnl 12004 120061 ). We discuss these effect in Section [5j 



The pulsar gates were so narrow that different lags accumulated different numbers of 
samples. The 8192 calculated lags spanned a time comparable to the width of a pulse 
gate: (1 msec) x (16 MHz) = 16000 complex samples; therefore, large lags accumulated 
fewer samples than small lags. The correlation function, rg, was correctly normalized by the 
number of samples contributing at each lag (Equation [T]); thus, the average of the correlation 
function (77) was the same as would have been measured with uniform sampling. However, 
the noise varied with lag. 

We Fourier-transformed the cross-correlation functions to form cross-power spectra. Be- 
cause the data were recorded in single sidebands, the cross-power spectra contained 8192 
channels with signal, each with bandwidth 1.95 kHz. The phase of the cross-power spec- 
trum included instrumental effects, primarily observational and instrumental delay s and 



rates, varying slowly with time and frequency ([Thompson. Moran. fc Swensonl Il986l ); and 
effect s of scintillation, varying more rapidly, over the timescale and bandwidth of scintilla- 
tion JPesai et al.1ll992h . 



The noise in a particular measurement of the correlation function was diluted by the 
integration time, but increased by the number of spectral channels. We accumulated 16 x 10 6 
statistically-independent complex samples per second. On Earth-based baselines, over a time 
of 2 sec, we sampled 22.4 pulse periods, for an average net integration time of 22.4 msec in 
one of our pulsar gates. Our 8192-channel spectra then contained 44 independent complex 
samples per spectral channel for each integration period. However, because of the truncation 
of the correlation function by the pulse gate discussed in Sections 12.11 and 15.21 higher lags 
accumulated approximately only half as many samples; thus, the average number of samples 
per spectral channel was approximately 33. On baselines to the VSOP spacecraft, the shorter 
integration time yielded 11 samples per channel and integration time for central lags, and 
approximately 8 samples per spectral channel. 

Tones were injected into the signal, for calibration, at Hartebeesthoek. These tones were 
separated by 511 channels (998 kHz) and were 1 or 2 channels wide. In cross-power spectra, 
they had the effect of increasing the noise in these spectral regions. We removed the narrow 
spectral regions containing these spikes from the spectra before continuing with analysis. 



We removed the average delay and rate by fringe- fitting (see lThompson. Moran. &: Swenson 



19861 ). The fit included a fringe rate in time, a delay or slope of phase with frequency, and an 
overall phase offset. For baselines between terrestrial antennas, we fringe-fit to (the central 
7168 channels in frequency) x (8 samples in time, or 16 sec), in the strongest gate, Gate 2. We 
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applied the fringe rate and delay to the other gates, but calculated the overall phase offset 
independently. The results appeared to be nearly the same as fitting such a model to other 
gates that contained strong signal, such as Gate 1; however, we prefer to use precisely the 
same model in weak and strong gates. For this paper, the primary purpose of fringe-fitting 
was to remove all instrumental effects, leaving only the effects of scintillation and those of 
statistical noise in the data. 



3.1. Typical Data 



Figure [T] shows some sample data, along with the scheme of pulse gates. To produce 
this figure, we averaged the real part of a segment of data for IF1 on the short Mopra- 
Tidbinbilla baseline, from 14:15:05 UT to 14:20:00 UT. This averaging reduced the depth of 
scintillation. We display the resulting average in the 5 gates as a function of pulse phase, 
including pulse gate and spectral dispersion. Each gate sampled the IF bandwidth over a 
short range of pulse phase; however, because of pulse dispersion, the low-frequency end of 
the gate sampled earlier parts of the pulse than the high-frequency end. The plot gives a 
rough idea of the pulse profile, although the effects of scintillation are still quite large and 
each sample is averaged over the 1 msec gate. As the figure shows, each phase in the pulse 
profile is represented twice, at two different frequencies. Each frequency is represented 5 
times, in each of the gates (as well as in the empty 6th gate, outside the plot). 

Figure [2] shows cross-power scintillation spectra. It compares the same 1-MHz spectral 
range for a single 2-sec integration, in Gates 1,2, and 6 (off-pulse). The scintillation appears 
as dramatic variations in amplitude, with large amplitudes concentrated in a few spectral 
regions. As measured from all of the Mopra-Tidbinbilla interferometric data, the typical 
scintillation bandwid th was 15 kHz (hal fwidth at half-maximum of the autocorrelation func- 
tion, in frequency) ( jGwinn et al.ll2012f ). This frequency scale is apparent in the spectra. 
Note, however, that noise modulates the scintillation peaks on finer scales and introduces 
differences in detailed shapes of the peaks between gates. The typical scintillation timescale 
was 9 sec (1/e point of autocorrelation function, in time). This timescale is longer than the 
integration time of the spectra. These scintillation scales are in good agreement with results 
extrapolated from single-dish measurements by other observers at othe r frequencies (see, 
for example, iRoberts fc Ablea Il982l : ICordes. Weisberg. fc Boriakofj|l985l ). The spectrum in 
Gate 6 appears completely noiselike: indeed, the samples are drawn from a circular Gaussian 
distribution in the complex plane, as discussed in Section 15.11 below. 
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4. 



OBSERVATIONS OF THE NOISE DISTRIBUTION 



4.1. Strategy: Noise Estimates from Differences 



4.1.1. Noise Estimates 



Differences between samples separated by much less than the scales of scintillation reflect 
noise and intrinsic variability; these differences can therefore be used to estimate properties 
of the noise. For example, consecutive samples in time can estimate both the signal (as their 
average) and the noise (as v2 times their difference). More generally, we can estimate the 
signal and noise as averages over, and differences among, groups of nearby samples. These 
estimates are unaffected by the scintillation if all of the samples are contained well within 
the characteristic frequency and time scales of the scintillation. 

Specifically, if we assume that the statistically-averaged signal s& is identical in N sam- 
ples Tk{tg), and that the noise rik,e is uncorrelated, then we can estimate both the signal and 
the noise as, 



where the index £ runs over the N samples compared. By binning the estimated noise, n^, 
according to the estimated signal, Sk, and then estimating the variance in each bin, we can 
identify the changes of the distribution of noise as a function of signal, and so identify the 3 
coefficients {&O5&I5&2} in Equation [71 

4.1.2. Effects of Scintillation, Variability, Correlation, and Binning on Noise Estimates 

The estimate of noise in Equation M does not include four effects that can contribute 
to noise: scintillation, amplitude variations, correlation of noise, and binning. Scintillation 
changes the spectrum over the scintillation bandwidth and timescale, so that differenced 
samples do not have identical averages. This change will increase the estimated variance by 
an amount proportional to the square of the signal, and so will affect estimates of the coef- 
ficient of the quadratic term, b 2 - We limit the span of averages to less than the scintillation 
time to minimize this effect. 

Scintillation also introduces differences between the autocorrelation functions at the two 
antennas, as discussed in Section [2. 3.31 above. The difference grows as the square of baseline 




(9) 



-14- 



length, divided by the scale of the diffraction pattern for short baselines. As discussed in 
Section 12.3.31 an average over many scintillation elements recovers the behavior given by 
Equation UJ However, this average over scintillation elements converges much more slowly 
than the average over realizations of the electric field used for short baselines. 



Amplitude variations of the Vela pu lsar are significant (jKrishnamohan fc Downs! 11983 
Johnston et al.ll200ll ; iKramer et all 120021 ) . Amplitude variations of the pulsar produce dif- 
ferences between spectra and thus mimic effects of noise. If the amplit ude can be estimated 
from individual spectra, the effects can be removed (IGwinn et al.ll201ll ). In the observations 
reported here, rapid variations in time appear as spectral variations because of dispersion, 
and the signal-to-noise ratio within a narrow region of the spectrum is too low to estimate 
amplitude reliably. However, the intrinsic variability of the pulsar is reduced by our averaging 
over 22 or 23 pulses. 

Correlation of noise in nearby spectral channels can arise from pulse gating, as discussed 
in Section 15.2.31 below. Larger correlations arise from variations o f intensity on timescales 



shorter than the ti me to accumulate a single sample of the spectrum (IGwinn fc Johnson!l2011 



Gwinn et al.ll201ll ). For these observations, this timescale is 8 192/16 MHz = 0.512 msec. The 
Vela pulsar shows significant variations on shorter timescales (jKrishnamohan Sz Downs 1983 



Johnston et al.ll200ll ; Kramer et all 120021 ) . so we expect these correlations to be significant. 
Therefore, we do not use differences among spectral channels to estimate noise, but only 
differences among times. (An exception is the VSOP-Tidbinbilla baseline discussed in Section 
14.5^ where low signal-to- noise ratio demands differencing in frequency as well as time.) 

Binning of the noise estimates by signal requires an accurate estimate of signal Sk- Even 
if the noise estimate is correct, it may be assigned to the wrong bin, altering the form of the 
estimated distribution. This problem is most severe when the distribution of signal varies 
within the span of the error in s/.. In this case, the noise can "leak" into the wrong bin. 
Consequently, low signal-to-noise ratio drives the analysis to few, large bins. The problem 
can be lessened by integrating over a longer time interval, so that the average is better 
determined. These intervals must be smaller than the scales of variations from scintillation, 
which can mimic noise. 

Alternative analyses, involving a model for the distribution of flux density of the scintil- 
lating source, avoid use of bins completely and so eliminate "noise leakage." Such analyses 
can estimate all paramete rs, including noise parameters, simultaneously (as in IGwinn et al. 
20121 ; iJohnson et al.ll2012l ). These methods are also insensitive to correlations between spec- 
tral channels. However, the differencing and binning analysis presented here allows straight- 
forward visualization of the noise distribution, as presented in Sections l4.2l through l4.5l below 
without any assumption about the underlying distributions of visibility or noise. 
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4.2. Observations: Noise in the Complex Plane 

The interferometric visibility is complex, and the distribution of noise varies with it 
over the complex plane. Figure [3] shows estimated noise as a function of average visibility 
in the complex plane, for the Hartebeesthoek-Tidbinbilla baseline. This baseline was long 
enough to span much of one scintillation element, so that the interferometric phase varies 
over 27T. Moreover, the antennas are large, so the resulting moderate signal-to-noise ratio 
allows the complex plane to be divided into many bins. Note that although the distribution 
extends over 2n in phase, it is concentrated toward the right: the average visibility lies on 
the positive real axis. 

We used data from IF2, Gate 2, channels 2048-3072, for the entire time span of the 
observation on that baseline. We formed spectra, and fringed the data, as described in 
Section[3]above, to align the average visibility with the positive real axis. We then differenced 
pairs of consecutive time samples to find the estimated noise and signal Sk- The data were 
binned by real and imaginary parts of the average signal with bin increments of 0.015 
correlator units. For bins containing more than 100 samples, we found error ellipses from the 
standard deviation of our estimates of the complex noise nk in that bin. The figure shows 
these ellipses. The displayed ellipses extend to one-half standard deviation of the noise in 
each bin, to reduce confusing overlap. 

The error ellipses have the form that Equation [7J suggests, with size increasing with 
distance from the origin, indicating increasing noise with increasing signal amplitude. El- 
lipses close to the origin are nearly circular, as they must be: the noise is independent of 
signal phase, when signal amplitude is near zero. Both dimensions of the error ellipses grow 
with increasing signal; however, the noise in phase with the signal grows faster, so that the 
ellipses become elongated further from the origin. Of course, amplitude variations in time, 
between pulses, would produce the same effect. However, the contribution of such amplitude 
variations is small for our 2-sec averages. 

4.3. Observations: Noise on a Short Baseline 

Figure [3] and the form of Equation [7] suggest that we find noise parallel with and perpen- 
dicular to the signal. Thus, as long as we consider perpendicular and parallel components 
separately, we can group together noise estimates with the same magnitude of visibility. 
Figure H] shows this analysis for IF2, Gate 1 Channels 5120-6144 on the Mopra-Tidbinbilla 
baseline. This short baseline provided a long observation, yielding many individual mea- 
surements of noise. The chosen gate and channel range are near the pulse peak, where the 
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source was strong and signal-to-noise ratio is high, providing for many, relatively narrow 
bins. As in Equation |9l we averaged the signal over 4 samples, or 8 sec. We then found noise 
by differencing individual samples from the average. For each bin in signal amplitude, we 
present histograms of noise in phase with the signal and at quadrature. 

As the figure shows, noise was equal in the two directions at zero signal, at lower left, and 
increased with signal amplitude to the right and upward. Moreover, noise increased more in 
phase with the signal, than in quadrature with it, so that the widths of the two distributions 
in each panel diverge as signal increases. The number of points decreased with increasing 
signal as well, representing the effect of the underlying distribution of visibility. The vertical 
scales are logarithmic, so that a parabolic shape indicates a Gaussian distribution. We do 
not display fits of Gaussian distributions to the histograms in the figure because these are 
nearly indistinguishable from the histograms. 

Figure [5] shows the variances of the best-fitting Gaussian distributions to noise, for the 
data shown in Figure HI The noise in phase with the signal (shown by circles in the figure) 
increases quadratically with signal amplitude, whereas noise in quadrature with the signal 
(shown by crosses) increases linearly. This is precisely the behavior expected, as discussed in 
Section [2T21 above. We fit these two curves with polynomials of the form given by Equation 
[0 We assume that the coefficients (except the quadratic coefficient b 2 ) are the same for erjj 
and a\. The fit is to points in the range 0.006 < I < 0.056. Points at larger / are based on 
a small number of samples. In the lowest bin, noise is slightly higher than expected in both 
components, compared with extrapolation from larger bins and the fit. The noise increases 
in all directions away from this bin, so that every point "leaked" from an adjacent bin tends 
to increase the noise, as discussed in Section 14.1.21 above. The y-intercept of the fit is close 
to the noise level estimated for the empty spectral range of this gate, discussed in Section 
15.11 below. The figure also shows, a s a dotted line, res ults for noise in a fit to the global 



distribution of visibility discussed in iGwinn et al.l ( 120121 ). The two techniques usually agree 



well, for gates and spectral ranges with high amplitude, so that leakage of noise into adjacent 
bins is small, and the distribution can be characterized well. 

As a measure of the quality of these fits, we note that in the absence of self-noise, all 
16 points in Figure H] would have a single value b . The mean square residual about the 
best-fitting single value is 6.5 x 10 _1 . Our model included two additional parameters, b\ 
and 62, and reduced the mean square residual to 2.4 x 10~ 3 , or by a factor of 268. The 
F-ratio test gives the likelihoo d of this improvement occurring by chance of less than 10~ 12 



( jBevington fc Robinson! 12003 ). Thu s, the 3-parameter model is excellent. If we adopt the 



parameters from lGwinn et al.l (120 121 ) shown by the dashed lines in Figure HI the mean square 



residual is 3.3 x 10 3 . 
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The difference between mi and a±_ reffects self-noise and effects of amplitude variations. 
The contribution of self-noise was l/iV obs = 1/33, as discussed in Section [3j As Equation [8] 
shows, amplitude variations on timescales longer than the accumulation time, approximately 
1 msec, and shorter than the 2-sec integration time, contributed to For the data in the 
figure, we found b 2 = 0.12. This value indicates that (SI 1 1) 2 = 0.09, as would be expected 



after integrating ov er 22 or 23 pulses ( jKrishnamohan fc Downs! Il983l ; iJohnston et al.l 12001 



Kramer et all 120021 ). 



4.4. Observations: Noise on an Intermediate Baseline 

Figure |6] shows the behavior of noise on the Hartebeesthoek-Tidbinbilla baseline. This 
baseline had projected length of over 9,000 km during the observation; however, it is interme- 
diate in length in the sense that it is comparable to the length scale of the scintillation pattern. 
Moreover, the baseline was short compared with the baseline to the VSOP spacecraft. We 
included data from the entire time span recorded, in IF2, Gate 1, channels 5120-6144. This 
is the same gate and frequency range as shown for the Mopra-Tidbinbilla baseline in Figure 
|5j This range was chosen for comparison, and because Gate 1 allows comparison with an 
empty region of the gate. We fit these two curves with polynomials of the form given by 
Equation to points in the range 0.006 < I < 0.056. Our 3-parameter noise model reduced 
the mean square residual over the 10 points in the fit, relative to a single variance, by a 
factor of 1112. The F-ratio test gives probability of this occurring by chance of less than 
5 x 10~ 12 . 

Comparison of Figures [5] and [6] shows interesting consequences of the longer baseline 
length. The maximum amplitude was smaller, because as the baseline length approaches the 
scale of the scintillation, occurrences of high intensity at both stations become less likely. 
The contribution of background noise 60 was smaller and the self-noise was greater, perhaps 
because Hartebeesthoek has larger area than Mopra (26 m rather than 22 m diameter). 

The points at high amplitude show scatter about the fitted curves; for this intermediate 
baseline, the average for b\ converges over some timescale intermediate between the many 
samples of electric field for a short baseline, and the many scintillations required for a long 
baseline. The first bin is elevated relative to the extrapolated, fitted curves, and the y- 
intercept of the fitted curves lies above the noise found in the empty part of the gate. This 
may result from leakage of noise adjacent bins, and the different convergence statistics. 



-18 - 



4.5. Observations: Noise on a Long Baseline 

As an example of noise on a long baseline, we analyze the long baseline from the VSOP 
spacecraft to Tidbinbilla, using procedures similar to those for the short Mopra-Tidbinbilla 
baseline. On this long baseline the phase of the cross-power spectrum varied through many 
turns because of scintillation. Instrumental phase variations were larger and more rapid than 
on the Mopra-Tidbinbilla baseline discussed above, and of course the 8-m spacecraft antenna 
is smaller than Mopra. Fringe-fitting the data, as described in Section [3j was challenging 
because of the rapid variation of phase and rate from spacecraft motion, the large variations 
of phase with scintillation and the low average visibility. 

For analysis of noise, we used fewer, larger bins in visibility so that the noise in a bin 
does not greatly exceed bin width. For our tests, we used data from the first orbit, from 
IF1. We used Gate 2, which is near the peak of the pulse and has high flux density across 
the observing band, to maximize signal and so ease fringe-fitting. To improve the quality of 
the noise measurement, and to reduce leakage into adjacent bins, we found averages for the 
real and imaginary parts of the visibility for spans of 4 channels in frequency, and 8 samples 
in time (7.8 kHz x 4 sec). The frequency span lies well within the scintillation bandwidth of 
15 kHz, and the time span within the scintillation timescale of 9 sec and the fringing time 
of 16 sec. 

The noise is different in phase and in quadrature with the signal. Figure [7] shows the 
variances <j\\ and a± plotted with the averaged magnitude of the visibility \sk\. We fit to the 
range 0.005 < < 0.020. Again, quadratic and linear models, with linear terms identical, 
fit the data well for small signal amplitude. The 3-parameter model reduces the mean square 
residual by a factor of 31, relative to a single-parameter model. The F-test gives probability 
of chance occurrence of less than 6 x 10~ 5 . 

The quadratic term is small compared with the linear term, in comparison with the 
other baselines, as expected for the lower signal-to-noise ratio for this less-sensitive baseline. 
The point at smallest amplitude shows considerably higher noise than that extrapolated from 
larger amplitudes, again most likely from 'leakage" of noise from other bins into this bin. 
As discussed in Section I2.3.3[ the noise converges to the expected form only in an average 
over many scintillation elements. This slow convergence, and the relatively small number of 
samples at large visibility, contribute to the variation of the points about the expected form. 
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5. ANALYSIS: CORRELATOR EFFECTS 
5.1. Statistics of Noise in an Empty Gate 

5.1.1. Variance: Short Baseline 

In spectral regions empty of signal, the average value of the cross-power spectrum is zero, 
and the noise closely approximates a Gaussian distribution. Figure [8] shows two examples 
for the Mopra-Tidbinbilla baseline: the distribution of the real and imaginary part of 
for IF1 channels 1024 to 2048 during the period 19:08:17 UT to 21:12:52 UT, in Gates 1 
and 6. Gate 6 was correlated only for IF1. The statistics of the real and imaginary parts 
are nearly identical, as expected for the noise, as the figure shows. The noise is clearly 
smaller in Gate 1, when the pulsar is "on" in another part of the gate. The variances are 
58.3 x 10 -6 correlator units 2 for Gate 1, and 96.5 x 10~ 6 correlator units 2 for Gate 6. 



The change in noise is an artifact of digitization ( iGwinnl 120061 ) . The reduction is the 
consequence both of the change in autocorrelation spectrum a and of the different levels 
of the quanti zer relative to the standard deviation of the signal (Tabled]). Equation 56 of 



Gwinnl (120061 ) gives the noise in the absence of signal: 



2N 

bo = Fc^r- ( A *2 + B ^ ~ !)) + B Y (& k - 1)) , (10) 

JVobs 

T his expr e ssion relates the notation of this paper on the left side of the equation, with that 



of IGwinnl ( 120061 ) on the right. Evaluation requires the autocorrelation spectrum subject 



to the normal ization condition: Y2 a k = 1> where the sum runs over the 2N spectral channels 



( IGwinnl 120061 . Equation 1). Effects of the change in the threshold for the quantizer vq are 
contained in the constants Ax2, Bx for station X and Ay2, By for Y. These quantities can 
be calculated from the statistics of the reduced-table 2-bit correlator with n = 3 and the 
values for vq given in Table [TJ The correlator-dependent gain Tq parametrizes instrumental 
effects. The number of spectral channels is 2N and the number of observations is N Q ^ S . For 
our observations, on a short baseline, « 33 as discussed in Section [3j 

For Gate 6, empty of any pulsar flux, the autocorrelation function was flat, and S/. = 1. 
When the pulsar turns on in part of the spectrum, the autocorrelation function in the off- 
pulse portion must fall, because at is normalized. This accounts qualitatively for the reduced 
noise level in the empty portion of Gate 1 relative to the completely-empty Gate 6, although 
changes in Vq also play a role. 

A quantitative calculation of the autocorrelation spectrum when the pulsar is "on" 
is necessarily indirect. We estimated the relative background noise of the antennas from 
tabulated system-equivalent flux densities, and used the average cross-power spectrum in 
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Gate 1 (a smoothed, larger average similar to that shown in Figure [T]) to estimate the 
contribution of the source to the spectrum. Using the normalization condition, we estimated 
&h in regions of the spectrum without pulsar flux. We used the tabulated average values 
of Vq and the correlator parameter n = 3 to determine the constants Ax2i Bx, Ay2, and 
By, for Gates 1 and 6. Equation [TU1 then provided the noise in the two cases; their ratio is 
independent of Fc- We found an expected ratio of 1.23 of the standard deviation of noise 
in Gate 6 to that in Gate 1. This is in approximate agreement with the measured ratio of 
1.29, from fits to the histograms shown in Figure [U A more sophisticated calculation might 
include use of the autocorrelation spectra for the two antennas, and quantizer populations 
measured synchronously with them; it might also include a spectral model for noise with 
frequency at each antenna. 



5.1.2. Noise and Number of Pulses 

The observed distributions of noise are superpositions of underlying distributions. For 
example, not all integration times contained the same number of samples. On the Mopra- 
Tidbinbilla baseline, the signal was integrated for 2 sec, or 22.4 pulse periods; more precisely, 
61% of the integrations contained 22 pulses, and 39% contained 23. Although the data are 
normalized to the integration time, the variance of the noise, parametrized by bo, will be 4% 
greater for the integrations with fewer pulses. 

Of course, the number of pulses was the same for all channels within a given sample 
of the spectrum. A spectral average provides an estimate of variance. We demonstrate the 
effect of integrating over different numbers of pulses in Figure [9J We plot a histogram of 
the variances over channels 1024 to 2048, for samples from the time span of the data in 
Section T4.3[ in IF1. The histogram shows 2 clear peaks. The centroids of the peaks lie at 
horizontal positions close to the expected ratio of 22:23. Their populations are close to the 
expected ratio of 61:39. The vertical lines show these positions, constrained to match the 
overall variance of the data set. The dotted line shows a simple fit of a model for the sum 
of two Gaussian distributions to the two peaks, with the normalizations and locations of the 
peaks set as for the lines, with net normalization equal to the number of samples, and the 
widths of the peaks equal. 
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5.1.3. Variance: Long Baseline to Spacecraft 

We analyzed noise for the long baseline from the VSOP spacecraft to Tidbinbilla, by 
examining the distribution of noise in individual channels and time samples. The much 
smaller number of pulses per 0.5-sec integration on the VSOP-Tidbinbilla baseline had a 
much greater effect on the noise, as Figure M shows. For this figure, we used IF1, in the 
empty Gate 6, for the data for Orbit 1. Comparison of the pulsar period with the integration 
time indicated that 40% of samples contain 5 pulses, and 60% contain 6 pulses. Again, the 
vertical lines under the two peaks show their expected locations, constrained to match the 
overall variance of the data set. Locations are in the expected 5:6 ratio, and populations 
40:60. We again show a fit of a model for the sum of two Gaussian distribution to the two 
peaks. In this case, the widths of the peaks are proportional to the squares of the variances, 
as expected for purely statistical contributions to the widths of the peaks. 

We found that noise in Gate 6 from the long baseline was nearly Gaussian, using an anal- 
ysis similar to that in Section I5TT1 We separated the noise samples into two groups, according 
to the variance. The variances of the two groups are 3.44x 10 -4 and 4.12x 10~ 4 (correlator units) 2 , 
respectively. The variance of the resultant distribution is a weighted sum of the two: 
3.71 xl0~ 4 (correlator units) 2 . In the empty portion of Gate 1, the statistics are similar, 
but with smaller resultant variance: 3.10xl0 -4 (correlator units) 2 , for channels 1024 to 
2048. As for the Mopra- Tidbinbilla baseline, the decrease in noise arises from the presence 
of signal elsewhere in the band. Using the method discussed in Section I5.1.1[ we estimated 
the expected ratio of the standard deviations of noise of in Gate 6 to that in Gate 1 to be 
1.19, for the VSOP-Tidbinbilla baseline. This expectation is in reasonable agreement with 
the measured ratio of 1.20. 



5.2. Distribution of Noise with Correlator Lag 



Variances and covariances completely characterize Gaussian nois e. Covariance of noise in 
different spectral ch annels can arise from quantization (IGwinnll2006[) , from source variability 
on short timescales (IGwinn &: Johnsonll201ll : iGwinn et al.ll201ll ). and from pulsar gating, as 
described in the following section. In this section, we calculate the covariance for noise in 
an empty gate and compare with observations. We then discuss how, for gates containing 
signal, the covarance introduced by source variability masks that from pulsar gating. 
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5.2.1. Pulsar Gate, Without Wrap 

Pulsar gating can introduce spectral c orrelations. Indeed, any temporal modulation is 
predicted to introduce spectral correlation (jGwinn &: Johnson! 1201 11 ). For our observations, 
the maximum lag correlated approaches the width of a pulse gate. Therefore, at large lag, we 
obtain fewer measurements of the correlation function than at small lag. The covariance of 
noise between two spectral channels is given by Fourier transform of the product of elements 
of the correlation function. The Fo urier transform of two copies of Equation [1] leads to an 
expression similar to Equation 22 of iGwinnl (120061 ): 

JV-l 



\hr* k+£ )-(h)(r* k+i ) = Yl 
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V,fl= 



N (N obs -\v + fj,\)(N ohs -\v\) n m 
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Note that, in our case, the limits of the sums over n and m depend on both v and fi. This 
is because the correlation functions r r ,r* are not averaged over all of the samples at both 
stations. The correlation of noise between spectral channels given by Equation [TT] depends on 
the specific form of the autocorrelation function, a T . However, the autocorrelation function 
is always maximum at a = 1 for lag r = 0, and usually falls rapidly to zero for larger lags 



T. 



5.2.2. Correlation of Noise: White Signal and White Noise 
We suppose in t his section that the original time series is spectrally-flat or "white" 



noise (jPapourislll99lf ). Consequently, the correlation function is zero except at the central 



lag: ao = 1, and a v — for v ^ 1. We then find: 

(r v r* v )-(r v )(r* v )= 1 (12) 
N ohs - \v\ 

Because we have normalized the correlation function and the signal is white, the mean square 
noise is just the reciprocal of the number of samples. 

In the spectral domain, this variation of the number of samples leads to correlation. In 
Equation [Til the product of a's is zero unless m = n and fi = 0. Performing the remaining 
sums, we find for the correlation between channels: 

N-i , 
(hr*k +i ) ~ (h) (fl + i) = E e»^H ■ (13) 

at -''obs ^ 

This correlation is largest for close spectral channels [I 2iV) because the noise in the 
correlation function is largest for large lag (v — >■ N). 
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5.2.3. Observed Correlation of Noise in an Empty Gate 

We compared observations on the Mopra-Tidbinbilla baseline with the theoretical pre- 
diction of Equation [121 an d find quantitative agreement. Figure [10] shows the results. To 
make the figure, we re-transformed the spectrum for each time interval, and found the mean 
square correlation function (r T r*). The data used to make Figure [TU1 are from Gate 6, IF2, 
of the Tidbinbilla-Mopra baseline, from 14:15 to 15:41 UT. We used the full recorded band- 
width. The downward-pointing spike at r = 5000 apparently results from interference. The 
solid curve shows the form predicted by Equation [12] as expected for iV bs = 16,000 complex 
samples, for the 1-msec length of the pulsar gate and our 16 MHz bandwidth. 

Figure [10] shows less noise at small lag, as Equation [121 suggests. From Fourier transform 
of the theoretical curve shown in the figure, we find that noise in the spectrum fk is anti- 
correlated in adjacent channels by approximately —6%, expressed as normalized correlation. 
This correlation falls off rapidly, however: it is only +0.4% in the second channel, and —0.7% 
in the third, and so on. 

5.2.4- Correlation of Noise: Signal Present 

If the spectrum is not "white", then Equation [TTl still holds, and cuo = 1. For typical 
spectra, we expect that correlations away from the zero-lag will be small: \a T \ <C 1, for 
r ^ 0. 

If all of the features of the spectrum are fully spectrally resolved, then any nonzero 
values of a T will be concentrated in a range close to r = 0, and the effect of the reduced 
denominator on the right-hand side of Equation [11] will tend to increase the noise at large 
lag r. On the other hand, if the spectrum is not fully resolved, some nonzero values of a T 
will be missing from the sum, which will tend to decrease the noise at large lags. Both 
effects are larger for higher lags r and, thus, will tend to introduce small-lag correlations 
in the conjugate spectral domain. The first will introduce negative correlations, the second 
positive correlations. For well-resolved spectra, the first effect will predominate, and will 
likely produce anticorrelations comparable to those estimated for white spectra and noise. 
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6. CONCLUSIONS 

6.1. Summary of Results 

We compare theoretical predictions for the distribution of noise for cross-power spectra 
with observations of a scintillating pulsar, the Vela pulsar. We describe observations made 
with Earth-based VLBI baselines, and with baselines from an orbitin g; spacecraft to Earth. 



These observations extend previous studies (IGwinn fc Johnson! 120 111 ) to the regime of high 



signal-to-noise ratio and large variations in interferometric phase. 

In Section [2], we argue that, in the presence of signal, noise on a short baseline should 
be drawn from an ellip tical G a ussian dist ribution in the complex plane. The theory was 



previously presented in IGwinn! ( 120061 ) and IGwinn fc Johnson! (120111 ). The major axis of the 



distribution is aligned with the direction of the signal. The variance along the minor axis 
depends linearly on signal strength; the variance along the major axis has the same linear 
dependence, plus a quadratic term. At zero signal, the major and minor axes are equal and 
the distribution of noise is a circular Gaussian, as for a gate or spectral region empty of 
signal. 

We test this theory with observations on the baselines from Mopra, Hartebeesthoek, 
and the VSOP spacecraft to Tidbinbilla in Section HI We estimate noise by comparing 
samples within the characteristic scales of the scintillation, and binning their differences 
by average interferometric visibility. We find that the distribution of noise closely follows 
the expected elliptical Gaussian form for each visibility, and the scaling with visibility of the 
major and minor axes corresponds to the quadratic and linear noise polynomials, respectively. 
The quadratic coefficient accurately reflects the number of samples and the contribution 
of intrinsic amplitude variations; the constant coefficient agrees with that estimated from 
empty portions of the spectrum for the Mopra-Tidbinbilla and Hartebeesthoek-Tidbinbilla 
baselines. 

In Section [5j we demonstrate that quantization, gating, and integration each affect the 
properties of the noise. One interesting consequence is that the noise in the presence of 
signal is less than that in a completely empty spectrum - a result of the combination of 
quantization and spectral variations. Also, pulsar gating leaves fewer samples, thus larger 
noise, at larger lags; this effect incurs correlations in the spectral noise. In principle, complete 
knowledge of the quantizer levels for each integration period, and of the autocorrelation 
functions at the two antennas, allows calibration of these effects. Alternatively, recording the 
signal with many quantizer levels increases the dynamic range, and reduces effect s of variation 



in qu antizer levels. Flexible software correlators, such as the DiFX correlator (IDeller et al. 



20071 ). can control these artifacts, while Nyquist-sampled spectra of individual pulses obviate 
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the d ifficulties in characterizing inhomogeneities within the integration (IJohnson &: Gwinn 
2012h . 



6.2. Self-Noise for an Interferometer 

We present observations for a scintillating pulsar, but the effects of self-noise hold for 
any interferometric observation. A careful evaluation of these effects is essential for a priori 
estimates of the accuracy of pulse-timing and spectroscopy using single-dish observations, 
and for scintillation studies and astrometry using interferometry. 

Many telescopes now under construction or being planned, such as LOFAR, ASKAP, 
SKA, and so on will operate as interferometers, with many baselines among many antennas 
of a particular design. Each baseline will have the distribution of noise we describe above 
in Section [2J and as we observe for the Vela pulsar. Because each telescope in a large 
array receives precisely the same noiselike signal from the source, increases in the number of 
antennas and averaging of many baselines do not change self-noise, when expressed in terms 
of flux density. However, averages over many baselines do decrease the background noise. 
Stated in terms of the notation introduced in Section [21 increasing the number of identical 
antennas Na reduces 60 as N^ 2 and b\ as N^ 1 but does not change 62- For true "tied-array" 
operation, where electric fields from all antennas are phased a nd summed before correlation, 



statistics are those of a single dish, ( iGwinn fc Johnson! |20 111 Equation 11). As the number 



of antennas become larger, self- noise becomes more important. When the source dominates 
the system temperature, further improvements demand more samples iV b s , as produced 
by wider bandwidth or longer integration time: a greater aperture does not provide more 
accuracy. Our expression Equation generalizes this result to interferometry. 

Astrometry depends on measurements of interferometric phase. Equations [5] and [7| show 
that the maximum attainable phase accuracy is 5(f) ~ a±/s ~ y/n/ (N \, s s) , or the inverse 
square root of the signal-to-noise ratio s/n, divided by y/N b s . Similarly, the maximum at- 
tainable accuracy in measurement of flux density by an interferometer, or a large single dish, 
is approximately the flux density of the source divided by y/N ohs . Likewise, the maximum 
attainable accuracy in pulsar timing is approximately the width of the narrowest feature in 
the profile, divided by the signal-to-noise ratio and by V^obs- However, when self-noise is 
the limiting factor, the maximum attainable accuracy is simply the width of that feature, 
divided by V^obs- 
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Fig. 1. — Gates across the pulse. Lower panel: Averaged cross-power spectrum for the 
Mopra-Tidbinbilla baseline, in IF1, displayed as a function of pulse phase. Upper panel: 
Gates displayed with pulsar phase and frequency. Gates are defined in frequency and time, 
and appear as parallelograms in this plot because of dispersion. Each point in the lower plot 
shows one spectral channel, at the center of 1 msec of pulse phase. Spikes show scintillations 
during the 5-min observing period. Dips in the pulse profile show rolloff of gain at the edges 
of the frequency band, in each gate. 
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Fig. 2. — One frequency range of cross-power spectrum for the Mopra-Tidbinbilla baseline 
in Gates 1, 2, and 6. Spectra are averaged over 2 sec starting at 15:43:00 UT. The first two 
gates show nearly the same spectrum, to within an overall factor, because they sample the 
same time and frequency interval of the scintillation pattern. Gate 6 contains only noise. 
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Fig. 3. — Distributions of noise, in bins of signal in the cross-power spectrum, in IF 2 Gate 
2 Channels 2048 to 3072 on the Hartebeesthoek-Tidbinbilla baseline. For each bin in real 
and imaginary part of average signal, the ellipse shows one-half the standard deviations of 
noise within that bin. Ellipses are plotted for bins that contain iV > 100 noise values. Labels 
are log 10 N. 
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Fig. 4. — Distribution of noise in phase with the signal (solid) and in quadrature (dotted), 
for bins in amplitude of the signal, in IF 2 Gate 1 Channels 5120 to 6144 on the Mopra- 
Tidbinbilla baseline. Noise was found by finding differences from the averages of 4 samples 
in time. Numbers at top right of each panel show the amplitude of the signal at the center 
of the bin. Units are correlator units. 
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Fig. 5. — Variances of noise in phase with the signal (circles) and at quadrature (crosses) 
estimated from differences of channels from the average of 4 consecutive samples in time. 
Data are from IF 2 Gate 1 Channels 5120 to 6144 on the Mopra-Tidbinbilla baseline. Solid 
curves show best-fitting line and parabola to the points, with the fit demanding the same 
linear terms for both. Leftmost and rightmost points we re excluded from th e fit. The dashed 
curve shows the curve from a fit to the full distribution (IGwinn et al.ll2012l ). The heavy tick 
on the y-axis shows noise in the spectral region of Gate 1 empty of signal (see Figure |8]). 
Units are correlator units. 
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Fig. 6. — Same as Figure EJ but for the Hartebeesthoek-Tidbinbilla baseline, for IF 2 Gate 
1 Channels 5120 to 6144. Again, the heavy tick on the y-axis shows noise in the spectral 
region of Gate 1 empty of signal. 
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Fig. 7. — Same as Figure[5l but the VSOP-Tidbinbilla baseline, and in IFlGate 2 in channels 
1024 to 3079, for Orbit 1 near apogee, for 14:52:16 to 15:19:49 UT. Averaged signals from 
(4 channels in frequency) x (8 samples in time). 
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Fig. 8. — Distribution of noise with zero signal for the Mopra-Tidbinbilla baseline in Gate 
6 Gate 1 in IF1, channels 1024 to 2048. Solid lines show real parts of cross-power f^, dotted 
lines show imaginary parts. Gate 6 is empty: the pulsar is "off". Gate 1 contains pulsar 
emission at higher frequencies, but is nearly empty in this spectral range. Solid curve shows 
fit for a Gaussian distribution to the histogram, for Gate 6. 
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Fig. 9. — Distribution of variance of noise in the empty Gate 6, averaged over the spectrum, 
plotted as a histogram. The vertical lines show the expected horizontal locations of peaks 
given by the numbers of pulses within an integration time: 22 and 23 for the upper panel 
(Mopra-Tidbinbilla baseline) and 5 and 6 for the lower panel (VSOP-Tidbinbilla baseline). 
Vertical extents of the lines show the expected relative populations of the two peaks. The 
dotted lines show a fit for the superposition of 2 Gaussian distributions to the histograms. 
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Fig. 10. — Correlation function of noise in empty Gate 6 on the MT baseline, plotted with 
lag r. Points show measured mean-square correlation function. Solid line shows theoretical 
curve for effects of pulse gating. A single parameter, the overall scale, has been fit to match 
theory to observation. 
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Table 1. Quantizer Levels: ±v 



On Pulse: Off Pulse: 

Gate 1, IF1 Gate 6, IF1 



Station 


Average 


Std. Dev. 


Average 


Std. Dev. 


Hartebeesthoek 


0.928 


0.007 


0.944 


0.007 


Mopra 


0.933 


0.005 


0.944 


0.005 


Tidbinbilla 


0.843 


0.017" 


0.947 


0.004 


VSOP Spacecraft 


0.923 


0.014 6 


0.923 


0.014 b 



a Standard deviation reflects pulse-to-pulse variations in inten- 
sity. 

b Standard deviation reflects primarily a slow drift over the time 
span of observations. 



